Skip to content

PARQUET-2165: Remove deprecated PathGlobPattern class#979

Closed
steveloughran wants to merge 1 commit into
apache:masterfrom
steveloughran:PARQUET-2165-PathGlobPattern
Closed

PARQUET-2165: Remove deprecated PathGlobPattern class#979
steveloughran wants to merge 1 commit into
apache:masterfrom
steveloughran:PARQUET-2165-PathGlobPattern

Conversation

@steveloughran

Copy link
Copy Markdown
Contributor

Remove the deprecated classes PathGlobPattern and
DeprecatedFieldProjectionFilter so that Parquet will
compile against hadoop 3.x.

If a thrift reader is configured to use the now-deleted filter,
by setting the filter in "parquet.thrift.column.filter",
a ThriftProjectionException will be thrown.

Jira

Tests

  • My PR adds the following unit tests OR does not need testing for this extremely good reason:

It modifies the test TestParquetToThriftReadWriteAndProjection to switch to the strict filter in all test cases where the old one was being used.

*these tests now all fail with ThriftProjectionException: No columns have been selected

I could cut the tests "obsolete" but it would seem to me that moving the tests to the strict filter would be better. I will just need help doing this.

Commits

  • My commits all reference Jira issues in their subject lines. In addition, my commits follow the guidelines from "How to write a good git commit message":
    1. Subject is separated from body by a blank line
    2. Subject is limited to 50 characters (not including Jira issue reference)
    3. Subject does not end with a period
    4. Subject uses the imperative mood ("add", not "adding")
    5. Body wraps at 72 characters
    6. Body explains "what" and "why", not "how"

Documentation

  • In case of new functionality, my PR adds documentation that describes how to use it.
    • All the public functions and the classes in the PR contain Javadoc that explain what it does

Remove the deprecated classes PathGlobPattern and
DeprecatedFieldProjectionFilter so that Parquet will
compile against hadoop 3.x.

If a thrift reader is configured to use the now-deleted filter,
by setting the filter in "parquet.thrift.column.filter",
a ThriftProjectionException will be thrown.
@Fokko

Fokko commented Oct 17, 2023

Copy link
Copy Markdown
Contributor

@steveloughran is this something that you still want to get in?

@Fokko

Fokko commented Oct 17, 2023

Copy link
Copy Markdown
Contributor

Seems to conflict with #1076

@steveloughran

Copy link
Copy Markdown
Contributor Author

This needs to be disposed of for building against more recent hadoop versions, so someone has to do it..

@github-actions

github-actions Bot commented Jul 1, 2026

Copy link
Copy Markdown

This pull request has been automatically marked as stale because it has had no activity for at least 2 months. If you are still working on this change or plan to move it forward, please leave a comment or push a new commit so we know to keep it open. Otherwise, this PR will be closed automatically in about one month. Thank you for your contribution to Apache Parquet!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants